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Abstract 

Many problems in quantum information theory can 
be vied as interconversion between resources. In this 
talk, we apply this view point to state estimation 
theory, motivated by the following observations. 

First, a monotone metric takes value between SLD 
and RLD Fisher metric. This is quite analogous to 
the fact that entanglement measures are sandwiched 
by distillable entanglement and entanglement cost. 
Second, SLD add RLD are mutually complement via 
purification of density matrices, but its operational 
meaning was not clear. 

To find a link between these observations, we define 
reverse estimation problem, or simulation of quan- 
tum state family by probability distribution family, 
proving that RLD Fisher metric is a solution to local 
reverse estimation problem of quantum state family 
with 1-dim parameter. This result gives new proofs 
of some known facts and proves one new fact about 
monotone distances. 

We also investigate information geometry of RLD, 
and reverse estimation theory of a multi-dimensional 
parameter family. 



1 Introduction 

Many problems in quantum/classical information 
theory can be viewed as interconversion between a 
given resource and a 'standard' resource, and such 
viewpoint had turned out to be very fruitful. This 
manuscript will exploit this scenario in asymptotic 
theory of quantum estimation theory (with some 
comments on classical estimation theory). 

Resource conversion scenario was first explored in 
axiomatic theory of entanglement measures. Entan- 
glement is a kind of quantum non-locality, which can- 
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not be explained by classical mechanical theory. Its 
effect is typically visible in so called maximally entan- 
gled states, which we regards as standard resources. 
The optimal asymptotic conversion ratio from maxi- 
mally entangled state to a given state is called entan- 
glement cost, while the optimal ratio for inverse con- 
version is called distillable entanglement. It is shown 
that all quantities which satisfies a set of reasonable 
axioms takes value between these two quantities. 

It had been known that a monotone metric in quan- 
tum state space takes value between RLD and SLD 
Fisher metric. It had been also pointed out that these 
metrics are mutually 'complement', in the following 
sense: A mixed quantum state can be regarded as a 
reduced state of pure state in larger system. In this 
view, SLD Fisher metric of a quantum state space 
equals RLD Fisher metric in the space of quantum 
states in ancilla system. 

In the manuscript, we link these two facts via re- 
source conversion scenario, giving new proof of the 
former, and monotonicity of RLD and SLD Fisher 
metrics. We also prove similar statement for quan- 
tum version of relative entropy, which, to author's 
knowledge, is new. In the discussion, estimation cor- 
responds to distillation of standard resource. 

In above discussion, we need 'reverse estimation', 
which we formulate as reverse estimation of quan- 
tum state families: Given a family of quantum state, 
its reverse estimation is a CQ map and a family of 
probability distributions such that the output of the 
CQ map equals the quantum state family (In fact, we 
mainly consider local version of this, to make a com- 
plement to local estimation theory, which is equiva- 
lent to asymptotic estimation theory). 

Next, we study local reverse estimation itself more 
in detail. Especially, we point out that, in general, 
local 'reverse estimation' is mathematically equiva- 
lent to local estimation with fixed set of observables. 
Straightforward calculation shows that optimal re- 
verse estimation corresponds to P-representation. 
For the Q-representation corresponds to optimal esti- 
mation, gap between reverse estimation and estima- 
tion origins from uncertainty contained in coherent 
states. 
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2 SLD and RLD, Fisher infor- 
mation 

In the manuscript, wc restrict ourselves to finite 
(namely, d- ) dimensional Hilbert space H, unless oth- 
erwise mentioned. The totality of density matrices is 
denoted by S (H) , and the totality rank r elements is 
denoted by Sr {H). In the manuscript, r = d, unless 
otherwise mentioned. Unless otherwise mentioned, 
a parameterized family of quantum states, often de- 
noted by = {pg; ^ G 6 C R™}, is assumed to be 
differentiable up to an arbitrary order. 

Define a symmetric logarithmic derivative Lg ^and 
a right logarithmic derivative L^^ as a solution to the 
matrix equation. 



where 5i := ^ 



If p0 is strictly positive, Lg^ and L^^ 
are uniquely defined in this way. If pe has zero eigen- 
values, . still can be defined, but not uniquely. 
Lg■^ exists (and if exists, unique) if and only if dipe 
has non-zero eigenvalues only in the support of pe- 
Observe they are quantum equivalences of a classical 
logarithmic derivative, di\ogp0{x). 

An SLD Fisher information matrix Jg and RLD 
Fisher information matrix are defined as 



tR 



mipeLl.Ll^, 
Tipeh'^^Lf,. 



They are quantum analog of a classical Fisher infor- 
mation matrix, 

■■= '^Pe{x)di \ogpe{x)dj logpe{x). 



For they are positive definite, they can be re- 
garded as metric tensors introduced to the tan- 
gent space Tg (^A ) , and the corresponding metrics 
are called SLD Fisher metric and RLD Fisher met- 
ric,respectively. 



Duality between SLD 
RLD, Reverse SLD 



and 



Denote by W the totality of matrices with finite num- 
bers of rows and columns. In the manuscript, an 
element of W is considered as an ordered set of un- 
normalized state vectors which forms a convex de- 
composition of a mixed state, with their magnitudes' 
corresponding to weights. Equivalently, an element 
of W can be interpreted as a representation of bi- 
partite pure state, whose reduced density matrix to 
one of the parties equals a given density matrix. The 
totality of dx d' elements of W is denoted by Wd' ■ 



We consider a map tt form W to <S (Ti), 

An interpretation of this map is as follows. Let 

W = [Vpi\<t>i) ,VPd^\M], 



then, 



d' 



AW) = Y^Pi \ct>i) {ct>i 



Another interpretation would be given by taking cor- 
respondence. 



I, J 



where {!(;;)} is an orthonormal basis in Ti. , and 
is an orthonormal basis in a Hilbert space H' for pu- 
rification. Then, 

niW) = Tvn' \^w) ■ 

Its differential map is denoted by tt* : T (W) i— !■ 
T {S (Ti.)), where Tw (W) means a tangent space to 
W at W, and T (W) is a tangent bundle, or the union 
of Tw (W), with W^s running all over W. An element 
of T (W) is naturally represented by an element of 
W by considering a parameterized family of an ele- 
ments W and differentiating with respect to a param- 
eter. Denote such representation of an element X of 
Tw (W) by MX , or more exphcitely. 



M 



d 



In that representation. 



w' 



w 



which is easily understood recalling Leibnitz's rule of 

differentiation of a product of two matrices. 

Observe that these maps are not unique. First, the 
map TT satisfies, 

Tr{WU) = Tr{W) 

where J7 is a matrix with UU"^ =1 (need not to be a 
unitary). Sometimes, this transform is refered to as a 
gauge transform. Correspondingly, the kernel of tt*, 
denoted by ICw (W), is 

)Cw (W) = {x ; MX = WA^, 3A^ = -A^^j . 

Denote an element of Tp{S{TL)) by X, and denote 
an SLD and RLD corresponding to X by x '^^^ 
L^^xi respectively. We define two inverse maps of 
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TT*, which are denoted by h^and hy^ (subscript W is 
often dropped) as, 



'■w 



with p = iriyV). It is easy to verify tt* 
TT* o = id. Consider subspaces CSw (W) and 
LUw (W) of Tw (W) which are defined by, 

CSw m = {x;X = h^{X), 3X e T,(,) {S (W))} , 
CUw {W) = [X;X = h^{X), 3X e T,(,) (5 (W))} 
It is easy to see 

A^ is said to be the reverse SLD at W. 

Define a map tt from W to S{Ti.') such that, 

n{W) = W^W = tt{W) = Tr„ \<^w) • 

Correspondingly, wc can define h^^{X), h^r{X), 
CSw (>V), and OZw (W), for which 

CSwiW) =) CTZwiW), 
aZw (W) D £5vK (W) 

holds. Especially, if d' = r, the LHS and the RHS 

coinside with each other. This means than RLD of 
the system corresponds to SLD of the ancilla system. 
Also, we have, 

TrpL^^L^ < Trw{W)A^A^. 

Especially, if d' = r, the equality holds. These rela- 
tions are called duality between SLD and RLD. 

4 Reverse estimation of quan- 
tum sate family and RLD 

The heart of quantum statistics is optimization of 

a measurement, i.e., choice of a measurement which 
converts a family of quantum states to the most infor- 
mative classical probability distribution family. Let 
us denote by pg'^ the probability distribution of mea- 
surement results of M applied to pe, and denote by 
Jg^ the classical Fisher information matrix of the 
probability distribution family {p^}- Then, it is 
known that, for a 1-dim quantum state family M, 

max = Ji, 

M:meas. 

or, Jg is the maximal amount of classical Fisher in- 
formation extracted from the 1-dim quantum state 



family {p0} at 9. In other words, we consider a QC 
map which maximizes the output Fisher information. 

Now, wc consider the reverse of above, i,c, emula- 
tion of the 1-dim quantum state family {pe} at 9o up 
to the first order, i.e., a pair {pe}) of the proba- 
bility distribution family {p$} such that with a QC 
channel such that. 



d* (pe) 



6=00 



Pea-, 
dpe 
dO 



(1) 



Our task is to optimize a pair (4>, {pe}), called local 
reverse estimation at Oq , to minimize Fisher informa- 
tion Je of the input {pe}- 

A local reverse estimation of {pe} at is con- 
structed as follows. Define a system of state vectors 
I'/'i) ) ■ ■ ■ ) \ 4'd') ) and a probability distribution {p{i)} 
by the equations, 

d' 

x=l 

This corresponds to a QC map $ which outputs \(j)x) 
according to the input probability probability distri- 
bution p{x). Define real numbers Ai, • • • , A^' by 



dpg 
d6 



d' 



x=l 



and define {pe} by pe (x) := p{x) + Xxp{x){9 - 9o). 
Then, the pair ($, {pe}) is a local reverse estimation, 
and any local reverse estimation is given in this way, 
essentially (i.e., modulo the difl[erence of o{\6 — 6o\)). 
Define also 



W = [v^|0i),--- 
A = diag(Ai,...,Ad')- 



Then , we have, ^ 



e=eo 



= WAWt, and 



Ll,iWP = WA, 

with P being the projector onto the support of tt {W) 
. The logarithmic derivative of {pe} a.t 6 = 6o is 



dlogpe{x) 



d9 



and its Fisher information is. 



— Ax, 



Jeo = ^(A,)'p(a^) =TrW^^^Wt 

x=l 

> TvWAPAW'' = J^. 
— So 

The equality holds if P equals the identity, or d' = r. 
Hence, to simulate {pe} at the neighbor of 9o up to 
the first order, we need classical Fisher information 
by the amount of Jg^ . 



Theorem 1 

max = Jjj, 

where maximization is taken over all the local reverse 
estimations of {p$} at 6q. 

5 Monotone metric revisited 

It is known that SLD Fisher metric and RLD Fisher 
metric are monotone by apphcation of CPT map, and 
any monotone metric takes value between SLD and 
RLD Fisher metric. In this section, we demonstrate 
operational meaning of SLD and RLD implies these 
properties in trivial manner. 

First, monotonieity of SLD is trivial because the 
optimization of measurement applied to the family 
{Ape} is equivalent to the optimization of measure- 
ment to {pg} over all the restricted class of measure- 
ment of the form M o A. 

The monotonieity of RLD Fisher metric is proven 
in the similar manner. Given a local reverse estima- 
tion ($, {pe}) of {pe} at (A o <t>, {pe}) is a local 
reverse estimation of the family {Ape} at ^o- We 
may be able to improve this reverse estimation to re- 
duce the amount of classical Fisher information of the 
probability distribution family. Thus the monotonie- 
ity of RLD Fisher metric is proved. 

Also, we can prove that SLD Fisher metric is no 
larger than RLD by considering composition of the 
optimal local reverse estimation followed by the opti- 
mal measurement. This operation, being a CPT map, 
cannot increase classical Fisher information. For the 
initial classical Fisher information equals RLD Fisher 
information and the final one equals SLD Fisher in- 
formation, we have the inequality. 

Assume that a metric is not increasing by a QC 
channel, and coincides with classical Fisher informa- 
tion restricted to classical probability distributions. 
Then, this metric slioidd be no smaller than SLD 
Fisher metric. Let us consider a 1-dim family {pe}- 
If one apply an optimal QC map, classical Fisher in- 
formation Jg of the output probability distribution 
family {pg} equals Jp^. Due to the latter assump- 
tion, = Jp^. Therefore, the monotonieity by a 
QC channel Qp^ > g^^ = Jf^. 

Similarly, assume that a metric is not increasing 
by a CQ channel and coincides with classical Fisher 
information restricted to classical probability distri- 
butions. Then, the metric should be no larger than 
RLD Fisher metric. Consider an optimal local reverse 
estimation of the 1-dim family {pg} at 9. Then, clas- 
sical Fisher information Jg of the input probability 
distribution family {pg {equals J^. Due to the latter 
assumption, Qp^ = J^. Therefore, the monotonieity 
by a CQ channel Qp^ < g^^ = J^. 



Altogther, if a metric is monotone non-increasing 
by application of QC and CQ maps, the metric takes 
value between SLD and RLD Fisher metric. 

Theorem 2 Assume that a metric g coincide with 

classical Fisher information in the space of classical 
probability distributions. In addition, if g is mono- 
tone decreasing by a QC map, g is larger than SLD 
Fisher metric. If g is monotone decreasing by a CQ 
map, g is smaller than RLD Fisher metric. 

6 Global reverse estimation 

Let us define a global reverse estimation of a quantum 
state family {pg}\s& pair ($, {pg}) of the probability 
distribution family {pg} such that with a QC channel 
such that, 

^(pe) = Pe,V^ee. 
This is equivalent to 

Pe = ^pe{i)\(l)i) {<t>i\ 

i 

= WoMgWl 

where Mg = diag(pg(x), • • • ,pg{x)). Let 

^e,i = dia.g{di\ogpg{x),--- ,dr\ogpg{x)), 
Wg = Wqs/Mq. 

For 1-dim restriction of achives RLD Fisher infoma- 
tion, we have to have, 

Ll^Wg = WgAl,. (2) 

Theorem 3 If pg is a full-rank matrix for all 6 G Q, 
the following three are equivalent. 

(i) The state family {pe} has a global reverse esti- 
mation such that its 1-dim restriction achieves 
RLD Fisher information at all G O. 

(ii) pg = Wg^MgWg^, where Mg is arxr Hermitian 
matrix, and [Mg^,Mg^\ = for all 6i, 62. 

(ill) [L^^, L^, .] = for all i, j, 9, and 9'. 

7 Two point reverse estimation 

Now, we turn to reverse estimation of two quantum 
states, p, (7, which is a pair {p\; X = p, a}) of a CQ 
map and a probability distribution family such that 
^{Pp) = P and ^{pa) = o- . The problem discussed 
here is the minimization of the divergence between 
the probability distributions between pp and p^. 



A 



It is known that the divergence equals a integral of 



metric along a curve, {p, 



(m) 



Jo 



tPp + (1 - t)P<y}, 



where Jf is a Fisher information of the family 
{pI™'}. This quantity is upper-bounded by 



D'^XrWo) /' f jf dsAt, 
Jo Jo 



(3) 



where the RLD Fisher information of the family 
of quantum states Observe that, for any 

reverse estimation, we have 



Pi 



(m) 



Hence, D {pp\\pa-) is maximized if J^™^ is maximized 
at each t. If j/^ = J^^"^ for all t {0 < t < 1), i.e., the 
reverse estimation is an optimal local reverse estima- 
tion at all i, i.e, a minimal reverse estimation, the 
reverse estimation should be optimal. In the proof, 
a key point was that image of m-affine curve is also 
m-affine. 

The integration Q is computed by Hayashi: 



D^{p\\a) = Tiplog p^a-^p^. 

8 Monotone Divergence 



(4) 



Let D'^{p\ \(t) be a quantity which coincides with clas- 
sical divergence in the space of probability distribu- 
tions, non-increasing by application of a CPT map, 
and is additive, 

DQ{pi(E)p2\\(Ji(E)a2) - D^ipiWai) + /^^(palka). 

Then, in the almost the same way as monotone met- 
ric, we can conclude such quantity is upper-bounded 
by D^{p\\(t), and lower-bounded by 



Dip\\a) 



^Trp(logp- logcr) . 



Assume that D'^{p\\a) is monotone by a QC map, 
coincide with the classical divergence for the proba- 
bility distributions, and is additive. It is known that 
there is a QC map such that the output probabil- 
ity distributions and p^^„ satisfies, D{p\\(t) = 
^^(PpLl&)+o(l). This implies 



Dip\\a) = -D^ipXlA 



< 



n 

D^ipWa 



pes 



||a«") + o(l) 
o(l). 



Here, tending n oo, we have D{p\\a) < D'^{p\\a) 
(This part is done by Hayashi). 

On the other hand, assume that D^{p\\a) is non- 
increasing by a CQ map, coincide with the classical 
divergence for the probability distributions. Then, 
letting (<&, {p\}) be an optimal reverse estimation, 

D^{p\\a) = D{p,\\p,)^DQ{p,\\p,) 
> D^ipWa). 

Theorem 4 Assume that D'^{p\\a) coincides with 
classical divergence for the probability distributions. 
In addition, if D'^(p\\a) is additive and non- 
increasing by a QC map, 

Dip\\a)<DQip\\a). 

On the other hand, if D'^{p\\a) is non-increasing by 
a CQ map, 

DQip\\a)<D"ip\\a). 

Can additivity assumption decrease the upper 
bound to the monotone divergences ? This cannot 
be true, for D^{p\\a) is additive. On the other hand, 
if we remove the additivity assumption, the lower- 
bound can be increased. 



9 Local reverse estimation of a 
multi-dimensional family 

A local reverse estimation can be recasted as follows. 
Under the constraint of 



Af = UAf'U\ Af', Af 



= ii,j^l,---,m), 



we minimize 



J2G.,TT^pUAf^A^ 
= J2G^J^TU^pUA^A 



(5) 



^,3 



where U is an isometry from H' to H, with dim 7^' > 
diniTi. 

Here, note the analogy of this with the local state 
estimation, which gives same result as the first order 
asymptotic theory. Assume we measure set of observ- 
ables A"* to estimate 0% i.e., A* — X^k^k-^k' where 
1^^; k| is an estimate of 0*, and {Af^} is a POVM for 

a measurement used for the estimation. Then, due to 
Naimark extension, we can find a set of observables 
A* (i = 1,2, • • • ,to) with 

A* = C/A»^7^ \x\xA^O = l,---,m). 



and 



Hence, establishing correspondence between X^ and 
Af, our target function corresponds to the 
weighted sum of the 'mean squared error' with the 
fixed set of observables. In other words, the problem 
is reduced to optimization of measurement in quan- 
tum estimation with the constraint X' = 

In particular, consider asymptotic exact reverse es- 
timation with corrective operation, i.e., the minimiza- 
tion of 

liminf - y G^.TrC/V^^L^If^'Af", 



with the constraint 



— „(8n 



Define 



Xl 



= -{pT) ^^pT {pTT'' ■ 

I " 

- y /«)•••«) (g) /•••«) /. 

II ^ — ^ 



fe=l 



Then, our target function is 

min lim inf n V Gi.TTiU'^ p^'^U'xfxf , 



2J 



where U runs over all isometry such that 



X^, X" 



0, = 



This corresponds to the asymptotic lower bound to 
the weighted sum of mean square error of corrective 
measurements. Hence, the minimum is given using 
so-called Holevo bound. For we have 

TrpAf = Jj^, 

due to Holevo bound, we have 

min lim inf n y G,„ TrJ7 V®" J/^fxf 



SpGSRJg^ + SpabsGSJg^, 
min {SpGJ; J>J^}. 



(6) 



classical Fisher information than the real part of 
RLD. 

On the other hand, if the given state family is D- 
invariant in Holevo's sense, the bound corresponding 
to the estimation is given, 

max {SpGJ; J< Je^} 
= SpG^J^ - SpabsGS Jg^, 

and the bound is achievable. This is smaller than the 
reverse estimation bound by SpabsG5J^. 

Example 5 (Gaussian state family) A Gaussian 
state family is defined by 



dpdq 

2'K(Jp(Jq 



This definition itsefl gives a global reverse estimation 
such that the coherent state \p, q) is according to the 
Gaussian distribution with the variance and the 
mean 6 = (6^,6'^). Its input Fisher information is 
J = cr^'^I, and 

SpJ = 2a-^. 



This in fact is optimal: 

SpSftJg^ -1- SpabsQ Jg^ 

where 



2a- 



1 



2 , n 
^ 2 
ih/2 + 



CT' + f -ih/2 



(a2 + h) a2 
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Note 



-Trpe [Lf , Lf ] , 



and this quantity is a measure of non-commutativity 
of RLD's. If this quantity is larger, we need more 



